Outlier grams

Chromagrams


The plot on the left shows the chromagram of “Sweetie Odo” by Juls and Sway Clarke. This was a typical Top Track of mine in 2020. You can clearly see the repetitive chord progression throughout the song –the skips from C to A to G. The sections chromagram clearly shows ‘blocks’ at D and G. The D-blocks represent the chorus vocals and the C-block represents the vocals in the bridge.

Cepstograms


The plot on the left shows the cepstogram of “Sweetie Odo” by Juls and Sway Clarke. This was a typical Top Track of mine in 2020. You can clearly see the intro and outro in c03. It is interesting to see that nothing much happens in c02.

Outlier self-similarity matrices

Just an average girl?

The corpus I am going to analyze consists of four playlists: my Top Tracks of 2019, my Top Tracks of 2020, the Top Tracks NL of 2019 and the Top Tracks NL of 2020. These playlists are respectively representative of the following groups I will be comparing: my taste in music in 2019, my taste in music in 2020, the average Dutch taste in music in 2019, and the average Dutch taste in music in 2020. In this portfolio, I want to find an answer to the following questions:

  1. How did my taste in music in 2020 differ from 2019? (comparison between group 1 and 2)

  2. How did the average Dutch taste in music in 2020 differ from 2019? (comparison between group 3 and 4)

  3. How average was my taste in music in 2019? (comparison between group 1 and 3)

  4. How average was my taste in music in 2020? (comparison between group 2 and 4)

I find it very interesting to analyze these comparisons, especially in light of the coronavirus pandemic. Due to the social distancing measures I did not listen to music in any social setting, such as hanging out with friends, clubbing, or even working out at the gym. My hypothesis is that my taste in music was therefore less average in 2020 than it was in 2019.

Evidently, my corpus is representative for the groups I want to compare, because it actually consists of those groups. However, I do have to remark that a Top Tracks NL playlist might not be representative of ‘the average’. It just contains those tracks that were listened to most often, possibly only within a certain demographic. Whether I belong to this demographic, I cannot say; there seems no information to be found about this anywhere on the internet.

My personal Top Tracks playlists also need a sidenote or two. First, I do not have a premium Spotify account, which means I get a limited amount of skips per hour and most playlists can only be played on shuffle. It might be that a song ended up a Top Track because it was in a playlist I listened to a lot, not because I liked that song so much. However, these limitations only apply when listening to Spotify on my phone. The desktop version of Spotify does allow for infinite skips and the freedom to choose songs manually, put songs in the waiting list, and play a playlist on shuffle or in order.

Moreover, Spotify might be biasing playlists by including certain –possibly sponsored– songs. Also, shuffle might not be completely random, playing popular or sponsored songs first. This way, I would be exposed to more popular songs, which might influence my Top Tracks, possibly making it more average.

A great example is “Stuck with U” by Ariana Grande and Justin Bieber. It is one of my Top Tracks of 2020. It used to be in a lot of different playlists I listened to at the time. And yes, I liked that song, but nevertheless, I am fairly sure there were other songs I liked more in 2020. I definitely consider this song atypical for the group ‘my taste in music in 2020’.

A song I consider very typical for my taste in music in 2019 is “Drive and Disconnect” by Nao. I remember listening to this song on repeat when I discovered it, but also for a longer time after that. And even a few months later I rediscovered this song, and fell in lover all over again. Even now, it is still one of my favorite songs.

While browsing the API Reference, I found the following variables that seem interesting to analyze: genres, artists, popularity, danceability, energy, valence, speechiness, instrumentalness, key, mode, tempo.

How are you feeling?

(No) time to dance

Observations

The variables that actually are interesting to discuss:

valence versus energy

(From a violin plot not shown in this storyboard)

  • Valence: My taste in music has remained fairly constant, although valence was more spread out in My Top Tracks of 2020 than in My Top Tracks of 2019. Interestingly, valence in Top Tracks NL 2019 is very nicely and equally distributed, unlike Top Tracks NL 2020. In fact, Top Tracks NL 2020 seems to have the inverted valence distribution of my Top Tracks playlists.

  • Energy: In both My Top Tracks of 2019 and Top Tracks NL of 2019, there is a bump around 0.7. They also spread out downward quite equally. Top Tracks NL 2020 seems to be the odd one out (again); energy was lower on average and more spread out, though with a small bump higher than the bumps of the other playlists.

tempo versus danceability

  • Tempo in 2019 was pretty much the same in both categories, both spiking at 100 bpm. My 2020 had a higer spike at 105 bpm. NL 2020 is totally different form the rest, with no real spikes, more spread out to lower (90) and higher (120) bpm’s.

(From a violin plot not shown in this storyboard)

  • Danceability: My taste in music has remained somewhat constant, with the bumps of My Top Tracks of 2019 and My Top Tracks of 2020 both at around 0.75, though the overall distribution is more spread out in 2020. Unlike the other plots, this plot shows that, with respect to danceability, playlists are more similar when grouping by year than they are when grouping by category (personal versus NL). Not only the shapes look more similar, the mean dancibilities seem to follow the same trend as well; the mean dancibility was slightly lower in 2020 than in 2019. This could be explained by the fact that, due to social distancing measures, there were less occasions to dance. As a result, artists also made less music to dance to.

mode

(From a bar plot not shown in this storyboard)

  • Mode: In both Top Tracks NL the ratio major versus minor is almost 50/50. My Top Tracks count more minor, with a 40/60 ratio.

The outliers identified


The outlier of My 2019 is “Sober - Stripped” by Raye.

The outliers of 2020 are “So Pra Mim (Acoustic)” by Sarita and “Only Wanna Be With You - Unplugged” by Samm Henshaw.